AITopics | normal barrier

Collaborating Authors

normal barrier

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

b2ea5e977c5fc1ccfa74171a9723dd61-Paper.pdf

Neural Information Processing SystemsFeb-9-2026, 21:31:16 GMT

adversary, algorithm, bandit, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.14)
North America > Canada (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

b2ea5e977c5fc1ccfa74171a9723dd61-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-9-2026, 21:31:04 GMT

normal barrier, reviewer, valuable comment, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.35)

Add feedback

Bias no more: high-probability data-dependent regret bounds for adversarial bandits and MDPs

Neural Information Processing SystemsAug-15-2025, 21:09:08 GMT

Besides its simplicity, our approach enjoys several advantages.

adversary, algorithm, bandit, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.14)
North America > Canada (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

b2ea5e977c5fc1ccfa74171a9723dd61-AuthorFeedback.pdf

Neural Information Processing SystemsAug-15-2025, 21:08:57 GMT

normal barrier, reviewer, valuable comment, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.35)

Add feedback

Adaptive Bandit Convex Optimization with Heterogeneous Curvature

Luo, Haipeng, Zhang, Mengxiao, Zhao, Peng

arXiv.org Machine LearningFeb-12-2022

We consider the problem of adversarial bandit convex optimization, that is, online learning over a sequence of arbitrary convex loss functions with only one function evaluation for each of them. While all previous works assume known and homogeneous curvature on these loss functions, we study a heterogeneous setting where each function has its own curvature that is only revealed after the learner makes a decision. We develop an efficient algorithm that is able to adapt to the curvature on the fly. Specifically, our algorithm not only recovers or \emph{even improves} existing results for several homogeneous settings, but also leads to surprising results for some heterogeneous settings -- for example, while Hazan and Levy (2014) showed that $\widetilde{O}(d^{3/2}\sqrt{T})$ regret is achievable for a sequence of $T$ smooth and strongly convex $d$-dimensional functions, our algorithm reveals that the same is achievable even if $T^{3/4}$ of them are not strongly convex, and sometimes even if a constant fraction of them are not strongly convex. Our approach is inspired by the framework of Bartlett et al. (2007) who studied a similar heterogeneous setting but with stronger gradient feedback. Extending their framework to the bandit feedback setting requires novel ideas such as lifting the feasible domain and using a logarithmically homogeneous self-concordant barrier regularizer.

algorithm, optimization, strong convexity, (13 more...)

arXiv.org Machine Learning

2202.0615

Country:

North America > United States > California (0.14)
Asia > China > Jiangsu Province > Nanjing (0.04)

Genre: Research Report (1.00)

Industry: Education > Educational Setting (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Bias no more: high-probability data-dependent regret bounds for adversarial bandits and MDPs

Lee, Chung-Wei, Luo, Haipeng, Wei, Chen-Yu, Zhang, Mengxiao

arXiv.org Machine LearningOct-29-2020

We develop a new approach to obtaining high probability regret bounds for online learning with bandit feedback against an adaptive adversary. While existing approaches all require carefully constructing optimistic and biased loss estimators, our approach uses standard unbiased estimators and relies on a simple increasing learning rate schedule, together with the help of logarithmically homogeneous self-concordant barriers and a strengthened Freedman's inequality. Besides its simplicity, our approach enjoys several advantages. First, the obtained high-probability regret bounds are data-dependent and could be much smaller than the worst-case bounds, which resolves an open problem asked by Neu (2015). Second, resolving another open problem of Bartlett et al. (2008) and Abernethy and Rakhlin (2009), our approach leads to the first general and efficient algorithm with a high-probability regret bound for adversarial linear bandits, while previous methods are either inefficient or only applicable to specific action sets. Finally, our approach can also be applied to learning adversarial Markov Decision Processes and provides the first algorithm with a high-probability small-loss bound for this problem.

algorithm, artificial intelligence, machine learning, (19 more...)

arXiv.org Machine Learning

2006.0804

Country: